Dataset statistics
| Number of variables | 29 |
|---|---|
| Number of observations | 1784 |
| Missing cells | 2 |
| Missing cells (%) | < 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 172.6 KiB |
| Average record size in memory | 99.1 B |
Variable types
| BOOL | 19 |
|---|---|
| NUM | 8 |
| CAT | 2 |
suburb has a high cardinality: 256 distinct values | High cardinality |
parking is highly skewed (γ1 = 20.56170848) | Skewed |
Unnamed: 0 has unique values | Unique |
ID has unique values | Unique |
bedroom has 145 (8.1%) zeros | Zeros |
bathroom has 114 (6.4%) zeros | Zeros |
garage has 826 (46.3%) zeros | Zeros |
parking has 1318 (73.9%) zeros | Zeros |
Reproduction
| Analysis started | 2020-09-25 11:36:13.811550 |
|---|---|
| Analysis finished | 2020-09-25 11:36:28.571168 |
| Duration | 14.76 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 1784 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 891.5 |
|---|---|
| Minimum | 0 |
| Maximum | 1783 |
| Zeros | 1 |
| Zeros (%) | 0.1% |
| Memory size | 13.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 89.15 |
| Q1 | 445.75 |
| median | 891.5 |
| Q3 | 1337.25 |
| 95-th percentile | 1693.85 |
| Maximum | 1783 |
| Range | 1783 |
| Interquartile range (IQR) | 891.5 |
Descriptive statistics
| Standard deviation | 515.1407575 |
|---|---|
| Coefficient of variation (CV) | 0.577835959 |
| Kurtosis | -1.2 |
| Mean | 891.5 |
| Median Absolute Deviation (MAD) | 446 |
| Skewness | 0 |
| Sum | 1590436 |
| Variance | 265370 |
| Monotocity | Strictly increasing |
| Value | Count | Frequency (%) | |
| 1783 | 1 | 0.1% | |
| 1196 | 1 | 0.1% | |
| 1174 | 1 | 0.1% | |
| 1176 | 1 | 0.1% | |
| 1178 | 1 | 0.1% | |
| 1180 | 1 | 0.1% | |
| 1182 | 1 | 0.1% | |
| 1184 | 1 | 0.1% | |
| 1186 | 1 | 0.1% | |
| 1188 | 1 | 0.1% | |
| Other values (1774) | 1774 | 99.4% |
| Value | Count | Frequency (%) | |
| 0 | 1 | 0.1% | |
| 1 | 1 | 0.1% | |
| 2 | 1 | 0.1% | |
| 3 | 1 | 0.1% | |
| 4 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 1783 | 1 | 0.1% | |
| 1782 | 1 | 0.1% | |
| 1781 | 1 | 0.1% | |
| 1780 | 1 | 0.1% | |
| 1779 | 1 | 0.1% |
| Distinct | 256 |
|---|---|
| Distinct (%) | 14.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 13.9 KiB |
| Sea Po | 80 |
|---|---|
| Plattekloof | 63 |
| Camps Bay | 58 |
| Constantia Upper | 55 |
| Baronetcy Estate | 50 |
| Other values (251) |
| Value | Count | Frequency (%) | |
| Sea Po | 80 | 4.5% | |
| Plattekloof | 63 | 3.5% | |
| Camps Bay | 58 | 3.3% | |
| Constantia Upper | 55 | 3.1% | |
| Baronetcy Estate | 50 | 2.8% | |
| Claremont Upper | 50 | 2.8% | |
| Foreshore | 46 | 2.6% | |
| Big Bay | 44 | 2.5% | |
| ondebosch | 42 | 2.4% | |
| Cape Town Central | 41 | 2.3% | |
| Other values (246) | 1255 | 70.3% |
Unique
| Unique | 95 ? |
|---|---|
| Unique (%) | 5.3% |
Length
| Max length | 45 |
|---|---|
| Median length | 12 |
| Mean length | 13.40919283 |
| Min length | 2 |
| Distinct | 14 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.942825112 |
|---|---|
| Minimum | 0 |
| Maximum | 13 |
| Zeros | 145 |
| Zeros (%) | 8.1% |
| Memory size | 13.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 13 |
| Range | 13 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.681306305 |
|---|---|
| Coefficient of variation (CV) | 0.571323895 |
| Kurtosis | 2.189194335 |
| Mean | 2.942825112 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.7259493596 |
| Sum | 5250 |
| Variance | 2.826790893 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 3 | 456 | 25.6% | |
| 2 | 444 | 24.9% | |
| 4 | 337 | 18.9% | |
| 5 | 167 | 9.4% | |
| 0 | 145 | 8.1% | |
| 1 | 139 | 7.8% | |
| 6 | 50 | 2.8% | |
| 7 | 22 | 1.2% | |
| 8 | 12 | 0.7% | |
| 10 | 5 | 0.3% | |
| Other values (4) | 7 | 0.4% |
| Value | Count | Frequency (%) | |
| 0 | 145 | 8.1% | |
| 1 | 139 | 7.8% | |
| 2 | 444 | 24.9% | |
| 3 | 456 | 25.6% | |
| 4 | 337 | 18.9% |
| Value | Count | Frequency (%) | |
| 13 | 1 | 0.1% | |
| 12 | 1 | 0.1% | |
| 11 | 1 | 0.1% | |
| 10 | 5 | 0.3% | |
| 9 | 4 | 0.2% |
| Distinct | 19 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.337163677 |
|---|---|
| Minimum | 0 |
| Maximum | 13 |
| Zeros | 114 |
| Zeros (%) | 6.4% |
| Memory size | 13.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 13 |
| Range | 13 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.540069786 |
|---|---|
| Coefficient of variation (CV) | 0.6589481948 |
| Kurtosis | 2.914619264 |
| Mean | 2.337163677 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.238854569 |
| Sum | 4169.5 |
| Variance | 2.371814946 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 2 | 529 | 29.7% | |
| 1 | 430 | 24.1% | |
| 3 | 225 | 12.6% | |
| 4 | 126 | 7.1% | |
| 0 | 114 | 6.4% | |
| 2.5 | 88 | 4.9% | |
| 5 | 79 | 4.4% | |
| 3.5 | 56 | 3.1% | |
| 4.5 | 37 | 2.1% | |
| 1.5 | 28 | 1.6% | |
| Other values (9) | 72 | 4.0% |
| Value | Count | Frequency (%) | |
| 0 | 114 | 6.4% | |
| 1 | 430 | 24.1% | |
| 1.5 | 28 | 1.6% | |
| 2 | 529 | 29.7% | |
| 2.5 | 88 | 4.9% |
| Value | Count | Frequency (%) | |
| 13 | 1 | 0.1% | |
| 10 | 2 | 0.1% | |
| 9 | 4 | 0.2% | |
| 8 | 4 | 0.2% | |
| 7.5 | 4 | 0.2% |
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.117152466 |
|---|---|
| Minimum | 0 |
| Maximum | 12 |
| Zeros | 826 |
| Zeros (%) | 46.3% |
| Memory size | 13.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 12 |
| Range | 12 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.302423457 |
|---|---|
| Coefficient of variation (CV) | 1.165842171 |
| Kurtosis | 4.706449638 |
| Mean | 1.117152466 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.473306876 |
| Sum | 1993 |
| Variance | 1.696306862 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 826 | 46.3% | |
| 2 | 500 | 28.0% | |
| 1 | 261 | 14.6% | |
| 3 | 108 | 6.1% | |
| 4 | 65 | 3.6% | |
| 6 | 11 | 0.6% | |
| 5 | 9 | 0.5% | |
| 12 | 1 | 0.1% | |
| 10 | 1 | 0.1% | |
| 8 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 826 | 46.3% | |
| 1 | 261 | 14.6% | |
| 2 | 500 | 28.0% | |
| 3 | 108 | 6.1% | |
| 4 | 65 | 3.6% |
| Value | Count | Frequency (%) | |
| 12 | 1 | 0.1% | |
| 10 | 1 | 0.1% | |
| 8 | 1 | 0.1% | |
| 7 | 1 | 0.1% | |
| 6 | 11 | 0.6% |
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5168161435 |
|---|---|
| Minimum | 0 |
| Maximum | 60 |
| Zeros | 1318 |
| Zeros (%) | 73.9% |
| Memory size | 13.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 60 |
| Range | 60 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.829142709 |
|---|---|
| Coefficient of variation (CV) | 3.539252269 |
| Kurtosis | 632.7535455 |
| Mean | 0.5168161435 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 20.56170848 |
| Sum | 922 |
| Variance | 3.345763049 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 1318 | 73.9% | |
| 1 | 257 | 14.4% | |
| 2 | 143 | 8.0% | |
| 3 | 27 | 1.5% | |
| 4 | 18 | 1.0% | |
| 8 | 7 | 0.4% | |
| 6 | 6 | 0.3% | |
| 10 | 4 | 0.2% | |
| 60 | 1 | 0.1% | |
| 15 | 1 | 0.1% | |
| Other values (2) | 2 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 1318 | 73.9% | |
| 1 | 257 | 14.4% | |
| 2 | 143 | 8.0% | |
| 3 | 27 | 1.5% | |
| 4 | 18 | 1.0% |
| Value | Count | Frequency (%) | |
| 60 | 1 | 0.1% | |
| 15 | 1 | 0.1% | |
| 12 | 1 | 0.1% | |
| 10 | 4 | 0.2% | |
| 8 | 7 | 0.4% |
erfSize
Real number (ℝ≥0)
| Distinct | 764 |
|---|---|
| Distinct (%) | 42.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 970.2300448 |
|---|---|
| Minimum | 38 |
| Maximum | 32397 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 13.9 KiB |
Quantile statistics
| Minimum | 38 |
|---|---|
| 5-th percentile | 190.6 |
| Q1 | 595 |
| median | 715 |
| Q3 | 845.25 |
| 95-th percentile | 2044.1 |
| Maximum | 32397 |
| Range | 32359 |
| Interquartile range (IQR) | 250.25 |
Descriptive statistics
| Standard deviation | 1732.925427 |
|---|---|
| Coefficient of variation (CV) | 1.786097469 |
| Kurtosis | 144.9050997 |
| Mean | 970.2300448 |
| Median Absolute Deviation (MAD) | 120 |
| Skewness | 10.76837172 |
| Sum | 1730890.4 |
| Variance | 3003030.537 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 715 | 699 | 39.2% | |
| 496 | 13 | 0.7% | |
| 495 | 9 | 0.5% | |
| 595 | 8 | 0.4% | |
| 180 | 8 | 0.4% | |
| 1000 | 7 | 0.4% | |
| 1200 | 5 | 0.3% | |
| 160 | 5 | 0.3% | |
| 1004 | 4 | 0.2% | |
| 652 | 4 | 0.2% | |
| Other values (754) | 1022 | 57.3% |
| Value | Count | Frequency (%) | |
| 38 | 1 | 0.1% | |
| 43 | 1 | 0.1% | |
| 58 | 1 | 0.1% | |
| 80 | 1 | 0.1% | |
| 90 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 32397 | 1 | 0.1% | |
| 24661 | 1 | 0.1% | |
| 24006 | 2 | 0.1% | |
| 22098 | 1 | 0.1% | |
| 21188 | 1 | 0.1% |
buildingSize
Real number (ℝ≥0)
| Distinct | 397 |
|---|---|
| Distinct (%) | 22.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 198.6199552 |
|---|---|
| Minimum | 14 |
| Maximum | 2400 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 13.9 KiB |
Quantile statistics
| Minimum | 14 |
|---|---|
| 5-th percentile | 50 |
| Q1 | 100.75 |
| median | 145 |
| Q3 | 206.5 |
| 95-th percentile | 600 |
| Maximum | 2400 |
| Range | 2386 |
| Interquartile range (IQR) | 105.75 |
Descriptive statistics
| Standard deviation | 180.8375422 |
|---|---|
| Coefficient of variation (CV) | 0.9104701593 |
| Kurtosis | 17.4043728 |
| Mean | 198.6199552 |
| Median Absolute Deviation (MAD) | 49.5 |
| Skewness | 3.115866589 |
| Sum | 354338 |
| Variance | 32702.21667 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 145 | 571 | 32.0% | |
| 55 | 20 | 1.1% | |
| 400 | 15 | 0.8% | |
| 78 | 14 | 0.8% | |
| 300 | 14 | 0.8% | |
| 60 | 13 | 0.7% | |
| 45 | 12 | 0.7% | |
| 81 | 12 | 0.7% | |
| 200 | 11 | 0.6% | |
| 72 | 11 | 0.6% | |
| Other values (387) | 1091 | 61.2% |
| Value | Count | Frequency (%) | |
| 14 | 1 | 0.1% | |
| 20 | 1 | 0.1% | |
| 22 | 1 | 0.1% | |
| 26 | 1 | 0.1% | |
| 27 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 2400 | 1 | 0.1% | |
| 1186 | 1 | 0.1% | |
| 1100 | 3 | 0.2% | |
| 1091 | 1 | 0.1% | |
| 1080 | 1 | 0.1% |
| Distinct | 1784 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 13.9 KiB |
| H_1135 | 1 |
|---|---|
| H_1405 | 1 |
| H_373 | 1 |
| H_8 | 1 |
| H_1400 | 1 |
| Other values (1779) |
| Value | Count | Frequency (%) | |
| H_1135 | 1 | 0.1% | |
| H_1405 | 1 | 0.1% | |
| H_373 | 1 | 0.1% | |
| H_8 | 1 | 0.1% | |
| H_1400 | 1 | 0.1% | |
| H_147 | 1 | 0.1% | |
| H_1070 | 1 | 0.1% | |
| H_1118 | 1 | 0.1% | |
| H_1591 | 1 | 0.1% | |
| H_1628 | 1 | 0.1% | |
| Other values (1774) | 1774 | 99.4% |
Unique
| Unique | 1784 ? |
|---|---|
| Unique (%) | 100.0% |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 5.379484305 |
| Min length | 3 |
price
Real number (ℝ≥0)
| Distinct | 587 |
|---|---|
| Distinct (%) | 32.9% |
| Missing | 2 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6732706.833 |
|---|---|
| Minimum | 269000 |
| Maximum | 49990000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 13.9 KiB |
Quantile statistics
| Minimum | 269000 |
|---|---|
| 5-th percentile | 965250 |
| Q1 | 2252750 |
| median | 3950000 |
| Q3 | 7950000 |
| 95-th percentile | 24688500 |
| Maximum | 49990000 |
| Range | 49721000 |
| Interquartile range (IQR) | 5697250 |
Descriptive statistics
| Standard deviation | 7597931.111 |
|---|---|
| Coefficient of variation (CV) | 1.128510612 |
| Kurtosis | 6.925952712 |
| Mean | 6732706.833 |
| Median Absolute Deviation (MAD) | 2155000 |
| Skewness | 2.481251198 |
| Sum | 1.199768358e+10 |
| Variance | 5.772855716e+13 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 3995000 | 21 | 1.2% | |
| 2950000 | 18 | 1.0% | |
| 4950000 | 18 | 1.0% | |
| 2850000 | 17 | 1.0% | |
| 2495000 | 17 | 1.0% | |
| 1295000 | 17 | 1.0% | |
| 2750000 | 16 | 0.9% | |
| 2995000 | 16 | 0.9% | |
| 1350000 | 15 | 0.8% | |
| 3200000 | 14 | 0.8% | |
| Other values (577) | 1613 | 90.4% |
| Value | Count | Frequency (%) | |
| 269000 | 1 | 0.1% | |
| 320000 | 1 | 0.1% | |
| 350000 | 1 | 0.1% | |
| 400000 | 1 | 0.1% | |
| 475000 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 49990000 | 1 | 0.1% | |
| 49000000 | 2 | 0.1% | |
| 47500000 | 1 | 0.1% | |
| 45000000 | 3 | 0.2% | |
| 43500000 | 1 | 0.1% |
propertyType_Apartment
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 0 | |
|---|---|
| 1 | 19 |
| Value | Count | Frequency (%) | |
| 0 | 1765 | 98.9% | |
| 1 | 19 | 1.1% |
propertyType_Guesthouse
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 0 | |
|---|---|
| 1 | 1 |
| Value | Count | Frequency (%) | |
| 0 | 1783 | 99.9% | |
| 1 | 1 | 0.1% |
propertyType_House
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 0 | |
|---|---|
| 1 | 4 |
| Value | Count | Frequency (%) | |
| 0 | 1780 | 99.8% | |
| 1 | 4 | 0.2% |
propertyType_Park
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 0 | |
|---|---|
| 1 | 1 |
| Value | Count | Frequency (%) | |
| 0 | 1783 | 99.9% | |
| 1 | 1 | 0.1% |
propertyType_apartment
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 1147 | 64.3% | |
| 1 | 637 | 35.7% |
propertyType_auction
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 0 | |
|---|---|
| 1 | 4 |
| Value | Count | Frequency (%) | |
| 0 | 1780 | 99.8% | |
| 1 | 4 | 0.2% |
propertyType_breakfast
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 0 | |
|---|---|
| 1 | 1 |
| Value | Count | Frequency (%) | |
| 0 | 1783 | 99.9% | |
| 1 | 1 | 0.1% |
propertyType_bus
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 0 | |
|---|---|
| 1 | 6 |
| Value | Count | Frequency (%) | |
| 0 | 1778 | 99.7% | |
| 1 | 6 | 0.3% |
propertyType_cottage
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 0 | |
|---|---|
| 1 | 4 |
| Value | Count | Frequency (%) | |
| 0 | 1780 | 99.8% | |
| 1 | 4 | 0.2% |
propertyType_farm
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 0 | |
|---|---|
| 1 | 1 |
| Value | Count | Frequency (%) | |
| 0 | 1783 | 99.9% | |
| 1 | 1 | 0.1% |
propertyType_flats
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 0 | |
|---|---|
| 1 | 1 |
| Value | Count | Frequency (%) | |
| 0 | 1783 | 99.9% | |
| 1 | 1 | 0.1% |
propertyType_guesthouse
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 0 | |
|---|---|
| 1 | 3 |
| Value | Count | Frequency (%) | |
| 0 | 1781 | 99.8% | |
| 1 | 3 | 0.2% |
propertyType_home
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 0 | |
|---|---|
| 1 | 12 |
| Value | Count | Frequency (%) | |
| 0 | 1772 | 99.3% | |
| 1 | 12 | 0.7% |
propertyType_house
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 1 | |
|---|---|
| 0 |
| Value | Count | Frequency (%) | |
| 1 | 959 | 53.8% | |
| 0 | 825 | 46.2% |
propertyType_land
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 0 | |
|---|---|
| 1 | 83 |
| Value | Count | Frequency (%) | |
| 0 | 1701 | 95.3% | |
| 1 | 83 | 4.7% |
propertyType_loft
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 0 | |
|---|---|
| 1 | 2 |
| Value | Count | Frequency (%) | |
| 0 | 1782 | 99.9% | |
| 1 | 2 | 0.1% |
propertyType_office
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 0 | |
|---|---|
| 1 | 4 |
| Value | Count | Frequency (%) | |
| 0 | 1780 | 99.8% | |
| 1 | 4 | 0.2% |
propertyType_property
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 0 | |
|---|---|
| 1 | 2 |
| Value | Count | Frequency (%) | |
| 0 | 1782 | 99.9% | |
| 1 | 2 | 0.1% |
propertyType_townhouse
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 0 | |
|---|---|
| 1 | 40 |
| Value | Count | Frequency (%) | |
| 0 | 1744 | 97.8% | |
| 1 | 40 | 2.2% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| Unnamed: 0 | suburb | bedroom | bathroom | garage | parking | erfSize | buildingSize | ID | price | propertyType_Apartment | propertyType_Guesthouse | propertyType_House | propertyType_Park | propertyType_apartment | propertyType_auction | propertyType_breakfast | propertyType_bus | propertyType_cottage | propertyType_farm | propertyType_flats | propertyType_guesthouse | propertyType_home | propertyType_house | propertyType_land | propertyType_loft | propertyType_office | propertyType_property | propertyType_townhouse | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | Clifton | 3 | 3.0 | 0 | 0 | 715.0 | 310.0 | H_1 | 49990000.0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 1 | 1 | Constantia Upper | 7 | 7.0 | 3 | 0 | 7555.0 | 145.0 | H_2 | 49000000.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| 2 | 2 | Bantry Bay | 3 | 3.5 | 2 | 0 | 626.0 | 145.0 | H_3 | 49000000.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| 3 | 3 | Fresnaye | 5 | 5.0 | 4 | 4 | 1044.0 | 900.0 | H_4 | 47500000.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| 4 | 4 | Bantry Bay | 3 | 3.0 | 2 | 0 | 715.0 | 546.0 | H_5 | 45000000.0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 5 | 5 | Waterfront (Cape Town) | 3 | 3.5 | 0 | 3 | 715.0 | 491.0 | H_6 | 45000000.0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 6 | 6 | Mouille Po | 4 | 2.0 | 2 | 1 | 715.0 | 261.0 | H_7 | 45000000.0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 7 | 7 | Constantia Upper | 5 | 4.0 | 3 | 0 | 4215.0 | 530.0 | H_8 | 43500000.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| 8 | 8 | Constantia Upper | 5 | 7.0 | 4 | 10 | 8210.0 | 145.0 | H_9 | 42000000.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| 9 | 9 | Waterfront (Cape Town) | 3 | 3.0 | 0 | 0 | 715.0 | 216.0 | H_10 | 41400000.0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Last rows
| Unnamed: 0 | suburb | bedroom | bathroom | garage | parking | erfSize | buildingSize | ID | price | propertyType_Apartment | propertyType_Guesthouse | propertyType_House | propertyType_Park | propertyType_apartment | propertyType_auction | propertyType_breakfast | propertyType_bus | propertyType_cottage | propertyType_farm | propertyType_flats | propertyType_guesthouse | propertyType_home | propertyType_house | propertyType_land | propertyType_loft | propertyType_office | propertyType_property | propertyType_townhouse | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1774 | 1774 | Woodlands (Mitchells Pla | 2 | 2.0 | 0 | 0 | 90.0 | 145.0 | H_1775 | 500000.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| 1775 | 1775 | Kalkfonte | 0 | 0.0 | 0 | 0 | 160.0 | 145.0 | H_1776 | 499000.0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 1776 | 1776 | Khayelitsha | 3 | 1.0 | 1 | 0 | 141.0 | 145.0 | H_1777 | 495000.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| 1777 | 1777 | Tafelsig | 3 | 1.0 | 0 | 0 | 144.0 | 145.0 | H_1778 | 480000.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| 1778 | 1778 | Eastridge | 3 | 1.0 | 0 | 0 | 280.0 | 145.0 | H_1779 | 480000.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| 1779 | 1779 | Maitland | 1 | 1.0 | 0 | 1 | 715.0 | 26.0 | H_1780 | 475000.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| 1780 | 1780 | Kle | 0 | 0.0 | 0 | 0 | 429.0 | 145.0 | H_1781 | 400000.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 |
| 1781 | 1781 | Mandela Park | 2 | 1.0 | 0 | 0 | 108.0 | 145.0 | H_1782 | 350000.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| 1782 | 1782 | Khayelitsha | 2 | 1.0 | 0 | 0 | 99.0 | 145.0 | H_1783 | 320000.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| 1783 | 1783 | Mitchells Pla | 0 | 0.0 | 0 | 0 | 715.0 | 66.0 | H_1784 | 269000.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |